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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Katz, Leonard 

Stassi, Diane L. 
Summers Jr., Richard G . 
Ruan, Xiaoan 
Pereda- Lopez, Ana 
Kakavas, Stephan J, 

(ii) TITLE OF THE INVENTION: NOVEL POLY KET IDE DERIVATIVES 
AND RECOMBINANT METHODS FOR MAKING SAME 

fiii) NUMBER OF SEQUENCES; 34 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Abbott Laboratories 

(B) STREET: 100 Abbott Park Rd. 

(C) CITY: Abbott Park 

(D) STATE: Illinois 

(E) COUNTRY: USA 

(F) ZIP: 60064-3500 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE; FastSEQ Version 2.0 

{vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 16-MAY-1979 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA : » 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Dianne Casuto 

(B) REGISTRATION NUMBER: P-40,943 

(C) REFERENCE /DOCKET NUMBER: 4952 .US. P2 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (847) -938-3137 

(B) TELEFAX: ( 847 ) - 93 8 - 2 623 

(C) TELEX: 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 925 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GGGCCGCTGG CGGTGATGTT CACCGGACAG GGCTCCCAAC GCCCCGGCAT GGGACGACAG 6 0 

TTGTACGAGC ACTTCCCCGT CTTCGCCCAG GCACTGGACG AGGTCTTCGC ACTCGCCACC 12 0 

CCCGGACTAC GCGAGGTGAT GTTCGACCCC GACCAGGCCG AAACACTCCA ACGCACCGAC 18 0 

CACGCCCAGA TCGCCCTGTT CGCCTTCGAA ACCGCCCTCT ACCGACTCTG GGAATCCTGG 24 0 

GGCCTGCGAC CCGACATGGT CTGCGGACAC TCGGTCGGAG AAATCACCGC AGCCCACGTC 3 0 0 

TCCGGCACCC TCACCCTCCC CGACGCCGTC CACCTCGTCA CCACACGCGG CACCCTCATG 3 60 

CAAAACCTGC CCCCCGGCGG CGCCATGCTC GCCGTCGCCA CCGACCCCCA CACCCTCCAA 4 20 

CCCCACCTCG ACAACCACCA CGACACCATC TCCATCGCCG CCATCAACGG CCCCCACGCC 4 B0 

ACCGTCCTCT CCGGCGACCG CACCACCCTC CACCACATCG CCACCCAACT CAACACCAAA 54 0 

CCCTTCACCA CCACCCTCAA CACCCTCACC CACCACCCCC CACACACACC CCTCATCAGC 6 00 

ATGCTCACCG CCACACCCAC CCACCCCGAC ACCACCCACT GGACCCAGCA CATCACCGCA 660 

CCCGTCCGCT ACACCGACAC CCTCCACCAC CTCCACCACC ACGGCATCAC CACCTACCTC 720 

GAAATCGGCC CCGACACCAC CCTCACCGCC CTCGCCCGCA CCACCCTCCC CACCACCACC 7 B0 

CACCTCATCC CCACCACCCG CCGCAACCAC AACGAAGTCC GCAGCACGAA CGAGGCGTTG B4 0 

GGCAGGGTGT TCAGCGTGGG CCACTCGGTG GACTGGCGGG CCCTCACTCC GACCGGGAGG 900 

(XJTACCTCCC TGCCGACGTA CCCCT <)2S 

(2) INFORMATION FOR SEQ ID NO: 2: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1030 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: 

CCTAGGACGG CAGTCCTGCT CACCGGGCAG 
CTGTACGACC GGTCACCGGT GTTCGCCGCC 
GGGCAACTGC CTCGTCCCCT CAAGGACGTT 
GCGCTCATCG ACCGTACGGT GTTCACACAG 
TTCCGGCTGT TCGAGGCCCA CGGCCTCGTC 
GAAGTGACCG CGGCCCACCT GGCCGGGGTC 
GCCCACCGCG GCCGCCTGAT GCAGTCGGCC 
GCGAGCGAGG ACGAGGTACG CGAGGCCCTC 
GGAGTCAACG GCCCGAACGC CACCGTCGTC 
GTCGCGCGCT GGCGCGAGCA GGGCAGGCGG 
CACTCGCCGC ACATGGACGG GATCGTCGAC 
TTCCGCTCCC CGACGATCCC GGTCGTCTCC 
CAGCTGACCT CGCCCGCGTA CTGGGCACGC 
GGGGTGCGGT ACCTGGAGGG CGAGGGCGTC 



SEQ ID NO: 2: 



GGTTCCCAGC 


GTCAGGGCAT 


GGGGCGCGAA 


60 


TCGTTCGACG 


CGATCTGCGC 


TCAACTCGAC 


120 


CTCTTCGCCC 


CCGAGGGGTC 


GGAGGACGCC 


180 


GCGGCTCTGT 


TCGCCGTGGA 


GACCTCCCTG 


240 


CCCGACTACC 


TCATCGGCCA 


CTCCATCGGC 


300 


CTCGATCTGG 


CGGACGCGTG 


CGTCCTGGTC 


360 


CGGGCCGGCG 


CJCGCGATGGC 


CGCGGTCCAG 


4 20 


GCGACCTTCG 


ACGATGCGGT 


TGCCGTGGCC 


480 


TCCGGCGACG 


AGGACGCGGT 


CGAGCGGCTG 


540 


ACGAAGCGGC 


TGCCGGTCAG 


CCACGCCTTC 


600 


GAGTTCGTCA 


CCGCCGTCTC 


CGGGCTCACC 


660 


AACGTCACCG 


GGACCCTCGC 


CACCGTCGAC 


720 


CACATCCGCG 


AGGCCGTGCG 


CTTCGCCGAC 


780 


ACCGAATGGC 


TGGAGCTCGG 


GCCCGACGGC 


040 
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GTTCTCGTCG CCCTGGTCGA GGACTGCCTG GCGAAGGAGG CGGGATCGCT CGCGTCCGCC 900 

CTGCGCAAGG GGGCGAGCGA GCCCCACACC GTGGGCGCGG CCATGGCCCG CGCGGTGCTG 96 0 

CGCGGATCCG GCCCCGACTG GGCGGCGGTG TTCCCCGGCG CACGGCGGGT CGACCTTCCG 1020 

ACGTATGCAT 103 0 

<2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
ATCTACACST CSGGCACSAC SGGCAAGCCS AAGGG 3 5 

(2) INFORMATION FOR SEQ ID NO: 4; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
CTSAAGGCSG GCGGCGCSTA CGTSCCSATC GACCC 3 5 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGCGAATTCC TAGGCTGGCG GTGATGTTCA 3 0 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO; 6: 
GCCGGATCCA TGCATACGTC GGCAGGGAGG TAG 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GCTCGAATTC GCTGGTCGCG GTGCACCT 

(2) INFORMATION FOR SEQ ID NO; 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 8 : 
GACGGATCCG GCCCTAGGCT GCGCCCGGCT CG 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TTGGGATCCT ATGCATTCCA GCGCGhGCGC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10 
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GAGAAGCTTG GCGCGACTTG CCCGCT 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTTTTTAAGC TTGGTACCTG CTCACCGGCA ACACCG 

(2) INFORMATION FOR SEQ ID NO; 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTTTTTGGAT CCCTGCAGCC TAGGGTCGGA GGCACTGCCG GT 
(2) INFORMATION FOR SEQ ID NO: 13: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
TTTTTTCTGC AGTATGCATT CCAGGGCAAG CGGTTCT 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
TTTTTTGAAT TCACGCGTTG CCCGCGGCGT AGGCGC 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 34 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
GATCGAATTC CCTAGGACGG CAGTCCTGCT CACC 

(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
GATCGGATCC ATGCATACGT CGGAAGGTCG ACCCG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
TTCGAAGAAT TCCCTAGGGT TGCCTTCCTG TTCGAC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
TTCGAAAAGC TTATGCATAG ACCGGCAGAT CCACCG 

(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CGGTSAAGTC SAACATCGG 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

GCRATCTCRC CCTGCGARTG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GAGAGAGGAA CCAACGCGCA CGTGATCGTC GAAGAGGCAC CAGC 
(2) INFORMATION FOR SEQ 10^0:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( Xi ) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
GAGAGAGGAT CCGACCTAGG CGCGGAGGTC ACCGGCGCGA CGGCG 
(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GAGAGACCTA GGAAGCCGGT GTTCGTGTTC CCCGGCCAGG GCT 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GAGAGAGGAT CCGAGGCCGG CCGTGCGCCC GGACCGAAGA CCGCCTC 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 
{C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GAGAGAATTC CCTAGGGTCG CCTTCGTCTT TCCCGGGCAG G 
(2) INFORMATION FOR SEQ ID Np:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:26: 

TTGAGATCTT ATGCATACGA GGGAAGCGGC ACCCTGC 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
TTGAGATCTT ATGCATACGA GGGAAGCGGC ACCCTGC 37 
{2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
TTGAGATCTT ATGCATACGA GGGAAGCGGC ACCCTGC 3 7 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



(D) 


TOPOLOGY : 


linear 










(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 29: 






GCCGACCGTG 


TCGTGTTCGT 


GTTCCCCGGC 


CAGGGCTCGC 


AGTGGGCCGG 


AATGGCCGAG 


60 


GGGCTGCTGG 


AGCGGTCCGG 


CGCGTTCCGG 


AGTGCGGCCG 


ACTCGTGCGA 


CGCCGCGCTG 


120 


CGGCCGTACC 


TCGGCTGGTC 


GGTGCTGAGC 


GTGCTGCGCG 


GGGAACCGGA 


CGCGCCCTCG 


180 


CTCGACCGGG 


TCGACGTCGT 


GCAGCCGGTG 


CTGTTCACGA 


TGATGGTCTC 


GCTCGCGGCG 


240 


GTCTGGCGTG 


CGCTGGGGGT 


GGAACCGGCG 


GCGGTCGTCG 


GGCACTCGCA 


GGGTGAGATC 


300 


GCCGCTGCCC 


ATGTCGCCGG 


TGCGCTGTCG 


CTGGACGACT 


CGGCCCGGAT 


CGTCGCCCTG 


360 


CGCAGTCGGG 


CGTGGCTCGG 


ACTGGCGGGC 


AAGGGCGGCA 


TGGTGGCGGT 


GCCGATGCCG 


420 


GCGGAGGAGC 


TGCGGCCGCG 


GCTGGTGACG 


TGGGGGGACC 


GTCTGGCCGT 


CGCCGCCGTC 


480 


AACAGCCCCG 


GTTCCTGCGC 


CGTCGCAGGC 


GACCCGGAGG 


CGCTGGCCGA 


ACTGGTGGCG 


540 


CTGCTGACCG 


GTGAGGGGGT 


GCACGCCCGG 


CCGATCCCCG 


GCGTCGACAC 


GGCGGGCCAC 


600 


TCGCCGCAGG 


TGGACGCGTT 


GCGGGCTCAT 


CTGCTGGAGG 


TGCTGGCCCC 


GGTCGCCCCC 


660 


CGACCGGCCG 


ACATCCCGTT 


CTACTCGACG 


GTGACCGGCG 


GGCTGCTGGA 


CGGCACCGAG 


720 


CTGGACGCGA 


CGTACTGGTA 


CCGCAACATG 


CGCGAGCCCG 


TCGAGTTCGA 


GCGGGCCACA 


780 


CGGGCGCTGA 


TCGCCGACGG 


GCACGACGTC 


TTCCTGGAGA 


CGAGCCCGCA 


TCCCATGCTG 


840 


GCCGTGGCGC 


TGGAGCAGAC 


GGTCACCGAC 


GCCGGCACCG 


ACGCGGCGGT 


GCTCGGGACC 


900 


CTGCGCCGCC 


GCCACGGCGG 


TCCTCGCGCG 


CTGGCCCTGG 


CCGTCTGCCG 


CGCCTTCGCG 


960 


AGGCGGTCTT 


CGGTCCGGGC 


GCACGGCCCG 


TGGAGTTGCC 


CACCTATQCG 




1010 



(2) INFORMATION FOR SEQ ID NO: 30: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 



CGCGCGCCTG CCTTCGTCTT TCCCGGGCAG GGCGCCCAGT GGGCCGGACT GGGAGCGCGG 60 

CTCCTCGCGG ACTCCCCCGT CTTCCGCGCC AGGGCCGAGG CATGCGCGCG GGCGCTGGAG 120 

CCTCACCTCG ACTGGTCGGT CCTCGACGTG CTGGCCGGCG CCCCGGGCAC CCCTCCCATC 3 80 

GACCGGGCCG ACGTGGTGCA GCCGGTGCTG TTCACCACGA TGGTCTCGCT GGCCGCCCTC 24 0 

TGGGAGGCCC ACGGGGTGCG GCCGGCCGCG GTCGTGGGCC ACTCCCAGGG CGAGGTGGCC 3 00 

GCGGCCTGCG TGGCCGGTGC CCTGTCGCTG GACGACGCTG CCCTGGTGAT CGCCGGACGC 3 60 

AGCAGGCTGT GGGGGCGGCT GGCCGGGAAC GGCGGGATGC TCGCGGTGAT GGCTCCGGCC 4 20 

GAGCGGATCC GTGAGCTGCT CGAACCATGG CGGCAGCGGA TTTCGGTGGC GGCGGTCAAT 4 80 

GGCCCCGCCT CGGTCACCGT CTCCGGTGAC GCGCTCGCGC TGGAGGAGTT CGGCGCGCGG 54 0 

CTCTCCGCCG AGGGGGTGCT GCGCTGGCCG CTGCCGGGCG TCGACTTCGC CGGCCACTCG 6 00 

CCGCAGGTGG AGGAGTTCCG CGCTGAGCTC CTGGACCTGC TCTCCGGCGT hCGGCCGGCT 66 0 

CCTTCGCGGA TACCTTTCTT CTCCACCGTG ACGGCGGGTC CTTGCGGCGG CGACCAGCTG 72 0 

GACGGGGCGT ACTGGTACCG CAACACGCGC GAACCCGTGG AGTTCGACGC CACGGTCCGG 780 

GCGCTGCTGC GTGCGGGCCA TCACACGTTC ATCGAGGTCG GTCCGCATCC GCTGCTCAAC 84 0 

GCCGCGATCG ACGAGATCGC AGCGGACGAG GGGGTAGCGG CCACGGCCCT GCATACGCTC 900 

CAGCGGGGCG CTGGCGGCCT TGACCGCGTG CGCAACGCGG TGGGCGCCGC TTTCGCGCAC <J6 0 

GGTOTCCGGG TCGACTGGAA CGCCCTGTTC GAGGGCACCG GTGCGCGCAG GGTGCCGCTT 102 0 

CCCTCGTACG CCTTC 1035 



{2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



Gly Pro 
1 

Met Gly 

Asp Glu 

Asp Pxo 

50 
Ala Leu 
65 

Gly Leu 



Leu Ala 

Arg Gin 

20 
Val Phe 
35 

Asp Gin 



Phe Ala 
Arg Pro 
Ala Ala His Val 



Val Met 
5 

Leu Tyr 

Ala Leu 

Ala Glu 

Phe Glu 

70 
Asp Met 
85 

Ser Gly 



Phe Thr 

Glu His 

Ala Thr 

40 
Thr Leu 
55 

Thr Ala 
Val Cys 
Thr Leu 



Gly Gin Gly 
10 

Phe Pro Val 

25 

Pro Gly Leu 

Gin Arg Thr 

Leu Tyr Arg 
75 

Gly His Ser 
90 

Thr Leu Pro 



Ser Gin 

Phe Ala 

Arg Glu 

45 
Asp His 
60 

Leu Trp 
Val Gly 
Asp Ala 



Arg Pro Gly 
15 

Gin Ala Leu 
30 

Val Met Phe 

Ala Gin lie 

Glu Ser Trp 
80 

Glu lie Thr 
95 

Val His Leu 
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100 105 110 

Val Thr Thr Arg Gly Thr Leu Met Gin Asn Leu Pro Pro Gly Gly Ala 

115 120 125 

Met Leu Ala Val Ala Thr Asp Pro His Thr Leu Gin Pro His Leu Asp 

130 135 140 

Asn His His Asp Thr He Ser He Ala Ala He Asn Gly Pro His Ala 
145 150 155 160 

Thr Val Leu Ser Gly Asp Arg Thr Thr Leu His His He Ala Thr Gin 

165 170 175 

Leu Asn Thr Lys Thr Aan Trp Leu Asn Val Ser His Ala Phe His Ser 

180 185 190 

Pro Leu Met Gin Pro He Leu Gin Pro Phe Thr Thr Thr Leu Asn Thr 

195 200 205 

Leu Thr His His Pro Pro His Thr Pro Leu He Ser Met Leu Thr Ala 

210 215 220 

Thr Pro Thr His Pro Asp Thr Thr His Trp Thr Gin His He Thr Ala 
225 230 235 240 

Pro Val Arg Tyr Thr Asp Thr Leu His His Leu His His His Gly He 

245 250 255 

Thr Thr Tyr Leu Glu lie Gly Pro Asp Thr Thr Leu Thr Ala Leu Ala 

260 265 270 

Arg Thr Thr Leu Pro Thr Thr Thr His Leu He Pro Thr Thr Arg Arg 

275 280 285 

Asn His Asn Glu Val Arg Ser Thr Asn Glu Ala Leu Gly Arg Val Phe 

290 295 300 

Ser Val Gly His Ser Val Asp Trp Arg Ala Leu Thr Pro Thr Gly Arg 
305 310 315 320 

Arg Thr Ser Leu Pro Thr Tyr Pro 
325 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

<D) TOPOLOGY: linear , 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



Pro Arg Thr Ala Val Leu Leu Thr 

1 5 
Met Gly Arg Glu Leu Tyr Asp Arg 
20 

Asp Ala He Cys Ala Gin Leu Asp 

35 40 
Asp Val Leu Phe Ala Pro Glu Gly 

50 55 
Arg Thr Val Phe Thr Gin Ala Ala 
65 70 
Phe Arg Leu Phe Glu Ala His Gly 



Gly Gin Gly Ser Gin Arg Gin Gly 

10 15 
Ser Pro Val Phe Ala Ala Ser Phe 
25 30 
Gly Gin Leu Pro Arg Pro Leu Lys 
45 

Ser Glu Asp Ala Ala Leu lie Asp 
60 

Leu Phe Ala Val Glu Thr Ser Leu 

75 80 
Leu Val Pro Asp Tyr Leu He Gly 
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His Ser 

Leu Ala 

Ser Ala 
130 
Glu Val 
145 

Gly Val 

Val Glu 

Arg Leu 

Val Asp 
210 
Thr lie 
225 

Gin Leu 

Arg Phe 

Trp Leu 

Cys Leu 
290 
Ala Ser 
305 

Arg Gly 



85 

He Gly Glu Val Thr Ala 
100 

Asp Ala Cys Val 

115 

Arg Ala Gly Gly 



Arg Glu Ala Leu 
150 

Asn Gly Pro Asn 
165 

Arg Leu Val Ala 
180 

Pro Val Ser His 
195 

Glu Phe Val Thr 
Pro Val Val Ser 

230 

Thr Ser Pro Ala 
245 

Ala Asp Gly Val 
260 

Glu Leu Gly Pro 
275 

Ala Lys Glu Ala 



Glu Pro His Thr 
310 

Ser Gly Pro Asp 
325 

Val Asp Leu Pro Thr Tyr 
340 



Leu Val 
120 
Ala Met 
135 

Ala Thr 

Ala Thr 

Arg Trp 

Ala Phe 
200 
Ala Val 
215 

Asn Val 

Tyr Trp 

Arg Tyr 

Asp Gly 
280 
Gly Ser 
295 

Val Gly 
Trp Ala 
Ala 



90 

Ala His 
105 

Ala His 

Ala Ala 

Phe Asp 

Val Val 
170 
Arg Glu 
185 

His Ser 

Ser Gly 

Thr Gly 

Ala Arg 
250 
Leu Glu 
265 

Val Leu 

Leu Ala 

Ala Ala 

Ala Val 
330 



Leu Ala Gly Val 
110 

Arg Gly Arg Leu 
125 

Val Gin Ala Ser 
140 

Asp Ala Val Ala 
155 

Ser Gly Asp Glu 

Gin Gly Arg Arg 
190 

Pro His Met Asp 
205 

Leu Thr Phe Arg 
220 

Thr Leu Ala Thr 
235 

His He Arg Glu 

Gly Glu Gly Val 
270 

Val Ala Leu Val 
285 

Ser Ala Leu Arg 

300 

Met Ala Arg Ala 
315 

Phe Pro Gly Ala 



95 

Leu Asp 

Met Gin 

Glu Asp 

Val Ala 
160 
Asp Ala 
175 

Thr Lys 

Gly He 

Ser Pro 

Val Asp 
240 
Ala Val 
255 

Thr Glu 

Glu Asp 

Lys Gly 

Val Leu 
320 
Arg Arg 
335 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: , ' 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 



Ala Asp Arg Val 
1 

Gly Met Ala Glu 
20 

Ala Asp Ser Cys 
35 

Leu Ser Val Leu 



Val Phe Val Phe 
5 

Gly Leu Leu Glu 

Asp Ala Ala Leu 
40 

Arg Gly Glu Pro 



Pro Gly Gin Gly 
10 

Arg Ser Gly Ala 
25 

Arg Pro Tyr Leu 
Asp Ala Pro Ser 



Ser Gin Trp Ala 
15 

Phe Arg Ser Ala 
30 

Gly Trp Ser Val 
45 

Leu Asp Arg Val 
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50 55 60 

Asp Val Val Gin Pro Val Leu Phe Thr Met Met Val Ser Leu Ala Ala 
65 70 75 80 

Val Trp Arg Ala Leu Gly Val Glu Pro Ala Ala Val Val Gly His Ser 

85 90 95 

Gin Gly Glu lie Ala Ala Ala His Val Ala Gly Ala Leu Ser Leu Asp 

100 105 110 

Asp Ser Ala Arg lie Val Ala Leu Arg Ser Arg Ala Trp Leu Gly Leu 

115 120 125 

Ala Gly Lys Gly Gly Met Val Ala Val Pro Met Pro Ala Glu Glu Leu 

130 135 140 

Arg Pro Arg Leu Val Thr Trp Gly Asp Arg Leu Ala Val Ala Ala Val 
145 150 155 160 

Asn Ser Pro Gly Ser Cys Ala Val Ala Gly Asp Pro Glu Ala Leu Ala 

165 170 175 

Glu Leu Val Ala Leu Leu Thr Gly Glu Gly Val His Ala Arg Pro He 

180 185 " 190 

Pro Gly Val Asp Thr Ala Gly His Ser Pro Gin Val Asp Ala Leu Arg 

195 200 205 

Ala His Leu Leu Glu Val Leu Ala Pro Val Ala Pro Arg Pro Ala Asp 

210 215 220 

He Pro Phe Tyr Ser Thr Val Thr Gly Gly Leu Leu Asp Gly Thr Glu 
225 230 235 240 

Leu Asp Ala Thr Tyr Trp Tyr Arg Asn Met Arg Glu Pro Val Glu Phe 

245 250 255 

Glu Arg Ala Thr Arg Ala Leu He Ala Asp Gly His Asp Val Phe Leu 

260 265 270 

Glu Thr Ser Pro His Pro Met Leu Ala Val Ala Leu Glu Gin Thr Val 

275 280 285 

Thr Asp Ala Gly Thr Asp Ala Ala Val Leu Gly Thr Leu Arg Arg Arg 

290 295 300 

His Gly Gly Pro Arg Ala Leu Ala Leu Ala Val Cys Arg Ala Phe Ala 
305 310 315 320 

His Gly Val Glu Val Asp Pro Glu Ala Val Phe Gly Pro Gly Ala Arg 

325 330 335 

Pro Val Glu Leu Pro Thr Tyr Pro 
340 

i 

(2) INFORMATION FOR SEQ ID NO:34; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

<ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Arg Ala Pro Ala Phe Val Phe Pro Gly Gin Gly Ala Gin Trp Ala Gly 

15 10 15 

Leu Gly Ala Arg Leu Leu Ala Asp Ser Pro Val Phe Arg Ala Arg Ala 
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20 25 30 

Glu Ala Cys Ala Arg Ala Leu Glu Pro His Leu Asp Trp Ser Val Leu 

35 40 45 

Asp Val Leu Ala Gly Ala Pro Gly Thr Pro Pro lie Asp Arg Ala Asp 

50 55 60 

Val Val Gin Pro Val Leu Phe Thr Thr Met Val Ser Leu Ala Ala Leu 
65 70 75 80 

Trp Glu Ala His Gly Val Arg Pro Ala Ala Val Val Gly His Ser Gin 

85 90 95 

Gly Glu Val Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Asp Asp 

100 105 110 

Ala Ala Leu Val lie Ala Gly Arg Ser Arg Leu Trp Gly Arg Leu Ala 

115 120 125 

Gly Asn Gly Gly Met Leu Ala Val Met Ala Pro Ala Glu Arg He Arg 

130 135 140 

Glu Leu Leu Glu Pro Trp Arg Gin Arg He Ser Val Ala Ala Val Asn 
145 150 155 160 

Gly Pro Ala Ser Val Thr Val Ser Gly Asp Ala Leu Ala Leu Glu Glu 

165 170 175 

Phe Gly Ala Arg Leu Ser Ala Glu Gly Val Leu Arg Trp Pro Leu Pro 

180 185 190 

Gly Val Asp Phe Ala Gly His Ser Pro Gin Val Glu Glu Phe Arg Ala 

195 200 205 

Glu Leu Leu Asp Leu Leu Ser Gly Val Arg Pro Ala Pro Ser Arg lie 

210 215 220 

Pro Phe Phe Ser Thr Val Thr Ala Gly Pro Cys Gly Gly Asp Gin Leu 
225 230 235 240 

Asp Gly Ala Tyr Trp Tyr Arg Asn Thr Arg Glu Pro Val Glu Phe Asp 

245 250 255 

Ala Thr Val Arg Ala Leu Leu Arg Ala Gly His His Thr Phe He Glu 

260 265 270 

Val Gly Pro His Pro Leu Leu Asn Ala Ala He Asp Glu He Ala Ala 

275 280 285 

Asp Glu Gly Val Ala Ala Thr Ala Leu His Thr Leu Gin Arg Gly Ala 

290 295 300 

Gly Gly Leu Asp Arg Val Arg Asn Ala Val Gly Ala Ala Phe Ala His 
305 310 • 315 320 

Gly Val Arg Val Asp Trp Asn Ala Leu Phe Glu Gly Thr Gly Ala Arg 

325 330 335 

Arg Val Pro Leu Pro Ser Tyr Ala Phe 
340 345 



